EQUATIONS IN SIMPLE MATRIX GROUPS: 
ALGEBRA, GEOMETRY, ARITHMETIC, DYNAMICS 



TATIANA BANDMAN, SHELLY GARION, BORIS KUNYAVSKII 

Abstract. We present a survey of results on word equations in simple groups, as well as their 
analogues and generalizations, which were obtained over the past decade using various methods, 
group-theoretic and coming from algebraic and arithmetic geometry, number theory, dynamical 
systems and computer algebra. Our focus is on interrelations of these machineries which led to 
numerous spectacular achievements, including solutions of several long-standing problems. 
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Once more a skald will other's song compose 
And utter it as of his own gift. 







OcHH MaH^nejibniTaM, 1914 






Osip Mandelstam, 1914 






(translated by Rafael Shusterovich) 




Contents 




1. 


Introduction 


2 


2. 


Main tools 


4 


2.1. 


Crucial assumptions 


4 


2.2. 


Lang- Weil inequality 


5 


2.3. 


Trace map 


5 


3. 


Equations in groups of Lie rank one and characterization of finite solvable groups 6 


4. 


Arithmetic dynamics 


10 


4.1. 


Verbal dynamical systems 


10 


4.2. 


Mapping tori of endomorphisms of free groups 


13 


5. 


Word maps: image and fibres 


14 


5.1. 


Borel's theorem 


14 


5.2. 


The image of the word map on finite simple groups 


15 


5.3. 


The fibres of the word map 


21 


6. 


Word maps on SL(2,g) and PSL(2,q) 


23 


6.1. 


Surjectivity for Engel words and some positive words 


23 


6.2. 


Criteria for equidistribution 


24 


7. 


Concluding remarks and open problems 


28 


References 


32 



2000 Mathematics Subject Classification. 14G05, 14G15, 20D06, 20D10, 20G40, 37P05, 37P25, 37P35, 37P55. 
Key words and plirases. matrix groups, finite simple groups, special linear group, word map, trace map, arithmetic 
dynamics, periodic points, finite fields, Lang-Weil estimate. 

1 



2 



TATIANA BANDMAN, SHELLY GARION, BORIS KUNYAVSKII 



1. Introduction 

Matrix equations, which in the most general form can be written as 

F(Ai,...,A„,Xi,...,Xd)=0, 

where Ai, . . . , Am arc some fixed matrices, Xi, . . . , are unknowns, F is an associative noncom- 
mutative polynomial, and the solutions must belong to a certain class of matrices, constitute a vast 
research domain, with spectacular applications well beyond algebra, say, in areas such as differential 
equations and mathematical physics. Noncommutativity brings lots of counter-intuitive phenom- 
ena, which can be observed even when asking deceptively simple questions about solvability of the 
equation or about the number of its solutions, even within the class of innocently looking quadratic 
matrix equations, see, e.g., [FS], [GRl], [GR2], [EGR], [CS], [Ge], [SI]. (Of course, the reader coming 
from differential equations and familiar, say, with Riccati matrix equations will not be too much 
surprised by the difficulty of the question.) 

To simplify the problem, one can limit oneself to considering equations of the form 

F{Xi,...,Xd)^A, (1.1) 

where A is the only allowed constant matrix, and only scalar constants are permitted to appear as 
coefficients of the polynomial F. Even in this limited form, the question about solvability of (1.1) is 
far from being settled though it has been extensively studied since the 1970's when Kaplansky asked 
about the existence of polynomials whose value sets consist of the scalar matrices. Such polynomials 
(called central) were discovered by Formanek [Fo] and Razmyslov [Ra]; naturally, if F is central 
and the matrix A is not scalar then equation (1.1) has no solutions. The same happens in the case 
where the value set of F consists of matrices with zero trace (say, when F is a Lie polynomial) and 
tr(^) 7^ (and, of course, in the trivial case where F is identically zero on the algebra M{n,K) 
of n X n- matrices with coefficients from the field K). However, there are subtler obstacles to the 
solvability of (1.1), see [KBMR] and references therein. Even a very special case where f is a Lie 
polynomial is not yet settled though some cases where (1.1) is solvable as well as some obstacles to 
solvability were discussed in [BGKP]. 

Further simplification is essential for the present survey: we focus our attention on the case where 
the solutions to the equations under consideration must belong to a certain multiplicative group of 
matrices. This naturally leads to the next modification: instead of considering polynomials F as in 
(1.1), which can be viewed as elements of the free associative algebra K (Xi, . . . , X^), we consider 
monomials w{xi,x^^ , . . . ,Xd,x'^^), which can be regarded as elements of the free d-generated group 
= {xit ■ ■ ,Xd). Thus our main object is the following equation (where we shorten our previous 
notation for w): 

w{xi,...,Xd) = g. (1.2) 
Here 5 is a fixed element of a group G, and we are looking for solutions among rf-tuples (51 , . . . , g^) 
of elements of G. By w{gi,...,gd) we understand the element of G obtained after substituting 
the gi instead of the Xi and performing all multiplications and inversions in G. We define w{G) = 
{w{gi, . . . , ga) : gi, . . . ,gd G G} as the set of values of w in G. Although equation (1.2) (henceforth 
called a word equation) makes sense in the case where G is not necessarily a group of matrices, we 
make emphasis on the matrix case. The reason is two- fold. First, main applications we are going to 
discuss refer to the cases where G has nice natural matrix representations. Second, our main goal is 
to stress the strength of algebraic-geometric methods in treating problems related to word equations. 
More specifically, our viewpoint can be described as follows: let us regard all matrix entries of the 
Xi as indeterminates, then, after clearing denominators arising because of determinants, we reduce 
equation (1.2) to a system of dn^ polynomial equations in variables over the ground field K (all 
matrices are assumed to be of size n x n). Thus we managed to go over from a noncommutative 
problem to a commutative one, but the paid price is high: the resulting system may be huge and 
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unaccessiblc to computer algebra systems. However, in certain cases such an approach may turn out 
to be fruitful, and several instances of successful implementation of this idea will follow. 

Once equation (1.2) is proclaimed as the main object of our investigation, the first natural question 
one should ask oneself is the following: what is the most natural class of groups G to start with? A 
strong hint is given by a theorem of Borel [Bo] (see also [La]), which states that if G is the group of 
rational points of a connected semisimple algebraic group defined over an infinite field, then for any 
w 7^ 1 equation (1.2) is solvable when its right-hand side is, roughly speaking, a "typical" clement 
g £ G; see Section 5.1 for a precise statement and some details. Thus the most substantial part 
of the present survey is devoted to the case where G is a finite simple group. Although in this 
case Borel's theorem docs not give any direct indication which elements should be thought of as 
"typical", we shall try to convince the reader that there are several ways out which make some 
sense. In particular, Larson [La] showed that for every nontrivial word w and e > there exists a 
number C{w,e) such that if G is a finite simple group with #G > C{w,e) then #w(G) > #G^~^, 
see Section 5.2. 

One should note that there is not much hope for positive results of Borel's flavour for groups G 
which arc too far from those singled out in his theorem; see the discussion in [BGKP, Section 5] 
and relevant references therein (to which one should add a recent paper by Myasnikov and Nikolaev 
[MN]). Therefore, as mentioned above, we restrict ourselves to considering the case where G runs 
within a family of finite non-abelian simple groups. Moreover, since most of the problems will be 
of asymptotic nature, we shall usually ignore the sporadic groups. Being even more restrictive, we 
focus our attention to the case of the family SL(2, q) (sometimes extending it to all groups of Lie 
rank 1, thus adding the family of Suzuki groups Sz((7)). This case is, on the one hand, typical 
enough, and sufficiently rich to have interesting applications. On the other hand, it allows one to 
use some additional efficient tools (such as trace maps) and get more precise results. Details can be 
found below, in Sections 3, 4 and 6. 

One can ask various questions on equation (1.2). First, one can fix a concrete group or a family of 
groups G (say, G = SL(2, q)), a concrete element g oi G (say, g = 1) and a concrete word w, and ask 
whether equation (1.2) is solvable (or has a non-identity solution, if 5 = 1 is chosen). Even in such a 
restrictive setting, some spectacular applications can be obtained, in particular, to a long-standing 
problem of characterization of finite solvable groups by recursive identities, see Section 3. A variation 
of this approach, where one proves solvability of a countable system of equations w„ = 1 (with G 
fixed as above), or, particularly, of the system Wm = Wn with Wi obtained after the i*^ iteration 
from certain initial data, can be reformulated in the language of periodic points. This viewpoint was 
(implicitly) taken in [BWW], where another sequence of words characterizing finite solvable groups 
was constructed, and made explicit in [BGKJ]. In the latter paper this machinery has been further 
developed which led naturally to some new concepts in arithmetical dynamics. Another impressive 
application of arithmetical dynamics to a hard group-theoretic problem has been demonstrated by 
Borisov and Sapir [BSl], [BS2], sec Section 4.2. 

Further, one can consider more general questions. First, still fixing some "interesting" word w, 
one may vary g G G and ask about the solvability of (1.2) for all g, or, at least, for "most of gr's". 
In the language of word maps, where we denote by 

w.G^^G 

the map given by (gi, . . . , g^) 1— > w{gi, . . . , gd), this means that we are asking whether the word map 
is surjective or, at least, has a "large" image. Ore's conjecture [Or] on representing every element 
of a finite simple group as a commutator is a fabulous instance of such a setting. Due to immense 
work spread over more than 50 years, it is now known that the commutator word w = [.x, y] G T2 
satisfies u'(G) = G for any finite non-abelian simple group G (see [LOSTl], the references therein 
and Section 5.2.2 below). It was therefore conjectured by Shalev that a similar result also holds 
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for iterated commutator (so-called Engel words). The study of this case started in [BGG] (for 
G = SL(2,g), see Section 6.1). The words of the form w = x°'y'' £ Ti have also attracted special 
interest. Larsen, Shalev and Tiep [LSTl] proved that any such word is surjective on sufficiently 
large finite simple groups. By further recent results of Guralnick and Malic [GM] and of Licbeck, 
O'Brien, Shalcv and Tiep [L0ST2], some words of the form Ou'y^ are known to be smjective on all 
finite simple groups. The particular case of surjectivity of these words on SL(2, q) was studied in 
[BG] (see Section 6.1). 

A ramification of this sort of questions, where one asks whether any element <? G G (or "most" 
of them) can be represented as a product of at most fc values of m; (A; is a fixed natural number), 
has been christened "a Waring-type problem" by Shalev (by analogy to a celebrated problem on 
representing a natural number as a sum of k powers), and a number of impressive results has been 
obtained. The most conclusive one, due to Larsen, Shalev and Tiep, says that for every nontrivial 
word w there exists a constant C(w) such that if G is a finite simple group satisfying #G > C{vi) 
then w(Gf = G, see [Sh2], [LaS2], [LSTl], [LST2] and Section 5.2.1 below. 

Note that a naive question whether w{G) = G for any nontrivial word w and all sufficiently large 
finite simple non-abelian groups G is clearly answered in the negative: indeed, it is easy to sec that 
if G is a finite group and m is an integer which is not relatively prime to the order of G then for the 
word w = x™ one has w{G) ^ G. Hence, if w e Jd is any word, then the word map corresponding 
to w = cannot be surjective. A natural question, suggested by Shalev, is whether these words 
are generally the only exceptions for surjectivity of word maps in finite non-abelian simple groups. 

Yet another type of problems, first introduced in [GS], arises when one asks about the behaviour 
of the fibres of the word map rather than about its image. Namely, Shalev asked [Shi, Problem 
2.10] whether (the cardinalities of) these fibres are equidistributed (or close to be equidistributed) 
when g varies in some "large" subset of the image of w and G runs over some family of finite groups. 
It was proved in [GS] that the word w = x'^y'^ G J^2, the commutator word w = [x,y] G J^2, sts well 
as the words w = [.xi, . . . , Xd] G Td, cZ-fold commutators in any arrangement of brackets, are almost 
equidistributed on the family of finite simple non-abelian groups. The case G = SL(2, q) was studied 
in some more detail in [BGG], [BG], [BK], see Section 6. 

We do not pretend that the present overview of word maps is comprehensive. As one can see from 

the title, shamelessly plagiarized from Manin's book [Man], it was conceived for emphasizing the 
role of various tools lying outside group theory, particularly, of algebraic-geometric and arithmetic- 
geometric nature, in solving hard group-theoretic problems (note, however, that the role of the 
"algebra-geometry-arithmetic" triad in Manin's setting was somewhat different: his focus was on 
systematic use of algebraic, mainly Galois-cohomological, methods in relating arithmetic phenomena, 
such as counter-examples to local-global principles, to geometric ones, such as non-rationality; our 
viewpoint is different: a typical target is a group-theoretic question whereas algebraic and arithmetic 
geometry provide an efficient machinery) . The reader inclined to purely algebraic approaches to word 
equations is referred to an excellent exposition given in a monograph by Segal [Seg] and a more 
recent survey by Nikolov [Ni]. Vast literature on equations and system of equations in free (and 
close to free) groups is left aside; see, e.g., [CRK], particularly the introduction, for a comprehensive 
bibliographical survey of this theory. 

2. Main tools 

In this section we shall briefly describe some common machinery used while treating word matrix 
equations, leaving details, which may vary from one problem to another, for subsequent relevant 
sections (note, in particular, that the problems discussed in Section 5 require lots of other techniques). 

2.1. Crucial assumptions. Our general setting is as follows. 
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(1) We view a matrix group G as an algebraic variety G C ^ embedded into an affine 
space, where the entries of a matrix v £ G play role of its coordinates {xx{v), . . . , Xn{v)) = 
{vi, . . . ,vn) (though this is not the case if G is a Suzuki group, G then has a "large" subset 
meeting this condition; see Section 3). 

(2) Group product (ui,U2) !—>■« = viV2 is a polynomial morphism G x G ^ G. 

Changing N if necessary, we can guarantee that the inversion will also be a polynomial map; in 
most cases, however, this is not needed because usually we work with matrices of determinant 1. 

Let d > 2 be an integer, and let w = 'w{xi, . . . ,Xd,) be a nontrivial element of the free group 

J^d = {xi, . . . , Xd) on d letters xi. This means that it is a reduced word in Xi and x'^^ with nonzero 
exponents. Then, given a group H, the word w defines a map : H'^ — >■ H, 

fw{hi,...,hd) =w{hi,...,hd). 
When no confusion may arise, we will shorten to w. 

Provided (1) and (2), we may interpret an endomorphism of a group as a polynomial morphism 
G ^ G and a word w{x\, . . . ,Xd) as a morphism G'^ G. Similarly, for a word map w. G'^ ^ G and 
an element g G G, a. solution of the equation w{xi, . . . , Xd) = g corresponds to an affine subvariety 

S{w,g)cG''. 

If G is a finite group (and this is the case in most applications considered in the present survey), 
we can choose the ground field to be a finite field ¥. If the problem under consideration requires a 
study of a family of finite matrix groups Gq, each defined over its own ground field Fg, it is convenient 
to view Gq as a fibre of a group Z-scheme Q. Then solutions to the equation w{x\, . . . ,Xd) = g are 
described by points S{w, g){¥q). 

This leads to the following local-global approach: instead of solving equations in an infinite family 
of groups Gq, one has to study a single Z-scheme S{w,g) and then look at its fibres at every q. 

2.2. Lang— Weil inequality. Once the initial group-theoretic problem is reduced to an arithmetic 
geometry problem of proving the existence of a rational point on a variety defined over a finite 
field, a natural tool to use is the Lang Weil inequality [LW] . It says that if X is an n-dimcnsional 
absolutely irreducible variety over ¥q, then asymptotically (i.e., for q large enough), the number 
of its rational points #X{¥q) does not differ too much from the number #P"(Fg) of points of 
the n-dimensional projective space (which is q" + q"~^ + ■ ■ ■ + q + 1). Namely, the difference is 

one can make this inequality effective, i.e., compute or 
at least estimate Ci and C2, one can guarantee the existence of an Fg-point on X for q large 
enough. Note that such a computation is related to deep topological properties of X and requires 
some information on its Betti numbers (in suitable cohomology), see [GL2] for a nice survey of 
various ramifications and generalizations of the Lang- Weil inequality. In the one-dimensional case 
the classical Weil estimate gives an expression of the remainder term via the genus of the curve. 

One should not forget another difficulty related to checking absolute irreducibility of X, which may 
be a highly nontrivial task in concrete examples. Most of arising problems comprise computational 
aspects, and usually require some advanced computer algebra to overcome them, see Section 3. 

2.3. Trace map. In the case where G = SL(2), apart from the general techniques described in the 

previous section, we have at our disposal another powerful tool going back to classical works of Vogt, 
Fricke and Klein [Vo], [Fr], [FK], cited here from the paper [Ho] (see also [Magi], [Mag2], [Gol] for 
a nice exposition of these results). 

Let Td = {xi,...,Xd) denote the free group on d generators. For G = SL(2, fc) (fc is any 
commutative ring with 1) and for any u € J^d denote by tr(M) : G'^ ^ G the trace character, 
i9i,---,gd) ^ tr{u{gi,...,gd)). 
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Theorem 2.1. If u is an arbitrary element of J^d, then the character of u can be expressed as a 
polynomial 

tv{u) = P{ti, ...,td, ti2, ti2...d) 

with integer coefficients in the 2*^—1 characters ti^i^..^^ = iT:{xi^Xi^ . . - Xi^), l<v<d, l<ii< 
i2 < ■ ■ ■ < iv < d. 

Let G = SL(2, q), and let tt: G"^ ^ A^'-i be defined by 

T^igi,- ■■,9d) = {h,- ■ ■,td,tl2, ■ ■ ■,tl2...d) 

in the notation of Theorem 2.1. 

Let Zd := niC^) C Let w: ^ G he a word map. It follows from Theorem 2.1 that for 

every d there exists a polynomial map tp: A"^ ~^ ^ A^ such that the following diagram commutes: 

G'i G 



Zd{V,) Ai. 

Moreover, for small d we have a more precise information: one can take Z2 = A^ and Z^ C A^ an 
explicitly given hypersurface. This diagram allows one to reduce the study of the image and fibres 
of w to the corresponding problems for tj), which may be much simpler, see Sections 4.1.3 and 4.1.4. 

3. Equations in groups of Lie rank one and characterization of finite solvable 

GROUPS 

As mentioned in the introduction, even a solution of only one word equation in a small family of 
finite groups may lead to spectacular consequences. Here is an instance of such a phenomenon. 

We consider a problem of characterizing various classes of groups by identical laws. Say, G is 
abelian if and only if \x, y] = 1 for all .x, y E G; a. finite group G is nilpotent if and only if 

(3n = n(G)) en{x,y) := [[x,y],^ . . ,y] = 1 

n times 

(Zorn's theorem [Zo]); what about finite solvable groups? 

Two-variable sequences. The purpose is to establish a characterization of the solvable groups in the 
class of finite groups by an inductively defined, Engel-like sequence of two- variable identities. This 
problem has a long history and admits many counterparts and generalizations; the interested reader 
is referred to [GKP] for a survey. 

Definition 3.1. Let w & T2 and / G J^3. 

(i) We say that a two- variable sequence {vn{x, y)} is defined by the recursive law {w, f) if 

vi{x,y) = w{x,y), Vn+iix,y) = f{x,y,Vn{x,y)). 

(ii) Wc say that the sequence is fc-valent {k = 1,2,3) if / = f{x,y,z) depends on k variables 
among which z must appear. 

(iii) We say that the sequence is Engel-like if f{x, y, 1) = 1 in 

(iv) We say that the sequence {w„} characterizes finite solvable groups if the following holds: 

A finite group G is solvable if and only if 

(3 n e N) (V(,r,y) G G x G) Vn{x,y) = 1. 

Remark 3.2. If {vn} is an Engel-like sequence, the condition Vn{x, y) — 1 implies that Vm{x, y) = 1 
for all m> n. 
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Example 3.3. The original Engcl sequence eorresponds to w{x,y) = [x.y]. f{x,y.z) = [z.y]. It is 
2-valent and Engel-like in the sense of the previous definition but, of course, does not characterize 
finite solvable groups. 

The first example of a sequence satisfying the conditions of Definition 3.1 was found in [BGGKPPl]. 
Let the commutator be defined as [a, 6] := aha~^h~^. Set 

w{x,y) =x~'^y~'^x, f{x,y,z) = [xzx~'^ ,yzy~'^]. 

The corresponding sequence is 

ui{x,y) := x~'^y~'^x, Un+i{x,y) := [xun{x,y)x~'^, yun{x,y)y~'^]. (3.1) 

Theorem 3.4. [BGGKPPl], [BGGKPP2] A finite group G is solvable if and only if for some n we 
have Un{x, y) = 1 for all x,y & G. 

This section is a brief exposition of the proof of this theorem. It involves algebraic geometry, 
arithmetic geometry, algebra and heavy MAGMA and SINGULAR computations. 

Remark 3.5. The role of the first word is very important: for example, if one changes the first word 
to a more natural one: ui{x, y) = w{x, y) = [x, y], keeping the same recursive law /, we do not know 
whether or not the obtained sequence characterizes finite solvable groups. Moreover, neither do we 
know this for the prototypical sequence arising from ui{x, y) = [x, y], f{x, y, z) = [[z, x], [z, y]], which 
characterizes finite-dimensional solvable Lie algebras (if [, ] is understood as Lie bracket) [GKNP]. 

More 3-valent sequences {un} for which Theorem 3.4 holds were produced in [Ri]. We conjec- 
ture after long computer experiments that for most (if not for any) choices of / satisfying obvious 
necessary conditions, there is an initial word Ui such that Theorem 3.4 holds; see the discussion in 
Section 4.1.4 and Conjecture 7.1 below. 

An example of a 2-valent sequence which still characterizes finite solvable groups was given in 
[BWW]: 

si{x,y):=x, Sn+i{x,y) ■.= [ysn{x,y)y~'^,sn{x,y)~^]. (3.2) 

Theorem 3.6. [BWW] A finite group G is solvable if and only if for some n we have Sn{x,y) = 1 
for all x,y G G. 

Whereas the proof of Theorem 3.4 is based on solvability of a single word equation in a family 
of simple groups (see the next section), the proof of Theorem 3.6 requires solvability of a system of 
countably many equations and has an evident dynamical fiavour, see Section 4.1. 

Equations in SL(2) and Suzuki groups. In this section we sketch a proof of Theorem 3.4. 

Clearly in every solvable group the identities Un{x,y) = 1 are satisfied from a certain n G N 
onward. By a standard argument, using minimal counter-examples, the nontrivial direction of 

Theorem 3.4 follows from 

Theorem 3.7. Let G be a finite non-abelian simple group. Then there are x, y G G such that 

ui{x,y) = U2(x,y), ui{x,y)y^l. (3.3) 

Moreover, the same argument shows that we only need to prove Theorem 3.7 for the groups G 
in J. Thompson's list of the minimal simple non-solvable groups [ThJ]. For simplicity we slightly 
extend it: 

(1) G = PSL(2, q) where q>4:{q=p"',p prime), 

(2) G = Sz(22™+i), m G N, 

(3) G = PSL(3,3), 
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because this does not make our proof more complicated. 

Here Sz(22™+i) (m G N) denotes the Suzuki groups (see [HB, XI.3]). 

For small groups from this list it is an easy computer exercise to verify Theorem 3.7. There are 
for example altogether 44928 suitable pairs x.y in the group PSL(3,3). 

The general idea of the proof is as described in Section 2. For a group G in the above list, 
using a matrix representation over F,, we interpret solutions of the equation ui{x,y) = U2{x,y) as 
Fq-rational points of an algebraic variety. Estimates of Lang Weil type for the number of rational 
points on a variety defined over a finite field guarantee in appropriate circumstances the existence 
of such points for big q. Of course we are faced here with the extra difficulty of having to ensure 
that ui{x,y) ^ 1 holds. This is achieved by taking x,y from appropriate Zariski-open subsets only. 

The case G = PSL(2, q). Let us look for a solution to (3.3) among the matrices of the form 



x{v) 







, y{v) 



b 

1 + bc 



where v = {t, b, c) G A^. 

SINGULAR computations show that the equation ui{x{v),y{v)) = U2{x{v),y{v)) defines an al- 
gebraic curve C C A^. Note that a priori one should expect dimC — because C is defined by 3 
equations in 3 variables. It is this dimension jump that forced a somewhat peculiar choice of the 
initial word Ui{x, y). 

It turns out that 

• C does not depend on F,; 

• C is absolutely irreducible for any p; 

• Pa{C) = lOidegC = 12 (where C denotes the projective closure of C, Pa is the arithmetic 

genus and deg is the degree). 

(The second and third statements were established by computer calculations; the proof of absolute 
irreducibility is technically the most difficult part; the interested readers are referred to the original 

papers.) 

From Weil's estimate for the number of rational points over a finite field it follows that C{¥q) > 
q+1 — 2pa^ — d, hence for q > 593 there are enough rational points on C to prove Theorem 3.7. 
Solutions for (3.3) in the groups G = PSL(2, q), q < 593, were found by computer. 

The case of Suzuki groups. In the proof of Theorem 3.7 the case of Suzuki groups G = Sz{q) 
(gr = 22™+!) is the most diflacult one. 

The main reason is that although Sz(g) is contained in GL(4, q), it is not a fibre of a Z-scheme. 

In fact the group Sz{q) is defined with the help of a field automorphism of ¥q ("square root 
of Frobenius", see, e.g., [Hu, Chapter 20] for a precise definition), and hence the standard matrix 
representation for Sz{q), obtained in the original paper of Suzuki [Su], contains entries depending 
on q. We shall describe now how our problem can still be treated by methods of algebraic geometry. 

This time one looks for a solution of (3.3) among the matrices parameterized by points of an 



8-dimensional space: for v 



(a, b, c, d, ao, bo, cq, do) G A^, let 

a^ao + ab + bo 
aao + b 
a 
1 



x{v) = 



b 

1 




c^co + cd+ do d c 1 

ccq + d Co 1 

c 10 

1 
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Moreover, x{v) G Sz{q) if and only if 

ao = a^'"^\bo = b^'"^\ (3.4) 

Similarly, y{v) G Sz{q) if and only if 

Co = c^""^\do = rf^™"^'. (3.5) 

Define T, := {v \ ui{x{v),y{v)) = U2{x{v),y{v))} C A^. 
From relations (3.4) and (3.5) it follows that 

S (Fg) - S n {ao = a'" , fco = , co = c^" , do = d'" } • 

In order to prove that the set S(Fg) is not empty, it is represented as the set of fixed points of 
an automorphism of S. 

Namely, consider an automorphism a: ^ defined by 

a{a, b,c,d,ao,bo,co,do) = {al,bl,cl,dl,a,b,c,d). 

Then is the Frobcnius automorphism. Moreover, from (3.4) and (3.5) it follows that x{v),y{v) G 
Sz{q) if and only if a™(w) = v, i.e., if and only if v is a fixed point of the m*'* iteration of a. 
Computations show that there exists a subset f7 C S such that 

(i) : ?7 is a smooth, absolutely irreducible, affine variety, 

(ii) : dim;7 = 2, 

(iii) : U is ct-invariant. 

Denote by V{U) = dimHi^.{U,Qi) the ^-adic Betti number (^ ^ 2). Then 
Proposition 3.8. b^{U) < 675 and b'^{U) < 2^2. 

The estimates contained in Proposition 3.8 are derived from results of Adolphson-Sperber [AS] 

and Ghorpadc Lachaud [GLl] allowing one to bound the Eulcr characteristic of an affine variety in 
terms of the number of variables, the number of defining polynomials and their degrees. Note that 
since U is affine, we have b^{U) = b^{U) = 0. Since U is nonsingular, the ordinary and compact 
Betti numbers of U are related by the Poincarc duality, and we have bl{U) = &^~'(t/). 

In order to estimate the number # Fix([/, n) of Q:"-fixed points in U, applied was the Lefschetz 
Trace Formula: 

4 

#Fix(C/,n) = ^(-l)nr(a" | Hi{U,Qe)) (3.6) 

i=0 

where Fix(C/, n) is the set of fixed points of a" acting on U. 

(A remark for the reader interested in algebraic-geometric details: technical difficulties arising 
from the fact that U is not projective and a is not Frobenius, were overcome, roughly, as follows: a 
can be extended to an endomorphism of having no fixed points in U\U, where U is the projective 
closure of U. Then one can use Dcligne's conjecture stating that a formula of Lefschetz type holds 
after composing a with a suflSciently high power of Frobenius, which, in our case, means a high 
odd power of a. Recall that Deligne's conjecture has been proved by Fujiwara [Fu]; see [Va] for 
simplifications and generalizations.) 

From formula (3.6) and Deligne's estimates for the eigenvalues of the endomorphism induced by 
a on the etale cohomology, an inequality of Lang-Weil type follows: 

|# Fix(C/, n) - 2"| < b\U) 2^"/'' + b^{U) 2"/^. 

An easy estimate shows that #Fix(t/, n) ^ for n > 48. The cases n < 48 were checked with the 
help of MAGMA. 
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4. Arithmetic dynamics 

In this section wc; present two group-theoretic problems for which the language of arithmetic 
dynamics appears to be an adequate one. The first one, discussed in Section 4.1, arose from attempts 
to understand the proof of Bray, J. Wilson, R. Wilson [BWW], who exhibited another sequence 
characterizing finite solvable groups (see Section 3), find an explanation of the phenomenon that a 
sequence possesses this property, and produce more such sequences. This was essentially done in 
[BGKJ], where the interested reader can find more elaborated constructions in arithmetic dynamics 
which are beyond the scope of the present survey. 

Another instance is related to the work of Borisov and Sapir [BSl], [BS2], see Section 4.2, where 
somewhat similar philosophy led to an answer to another long-standing group-theoretic question 
(this time, from the theory of infinite groups). 

4.1. Verbal dynamical systems. Given a two-variable sequence {u„(.T,y)} defined by a recursive 

law (w.f) (see Definition 3.1), one can ask whether or not it characterizes finite solvable groups. 
An obvious necessary condition is that {vn} must descend along the derived series, and we always 
assume that this condition holds. We also assume that {u„} does not contain the identity word. 
Then the only condition to check is the following one: 

G is not solvable {\/n){3{x,y) € G x G) : Vn{x,y) ^ 1. 

This condition may be further reduced (see Section 3) to the following property. 

Property 4.1. Let G be one of the groups PSL{2, q){q ^ 2,3), Sz(22™+i), or PSL(3,3). Then 

{yn){3{x,y)€GxG):Vn{x,y)j^l. 

Since it is very easy to check the single case PSL(3,3), we assume throughout below that the 
property holds for this group. 

We will say that a sequence is very good if Property 4.1 holds for all groups listed therein and is 
good if it holds at least for all PSL(2, q){q ^ 2, 3). In this section we want to describe good sequences. 

For a further simplification, the following observation is crucial: if is an Engel-like sequence 
then Property 4.1 holds in a group G as soon as 

(3n)(3rn > n){3{x, y) £ G x G) : Vn{x, y) = ^^(a;, y) ^ 1. 

This simple reformulation allows one to replace the proof of solvability of a concrete equation (say, 
vi{x,y) = V2{x,y) ^ 1) in a family of groups, as was done in Section 3, by the proof of existence of a 
preperiodic point (different from identity) of a certain dynamical system generated by the recursive 
law /. 

Recall that a sequence Vn{x, y) = {w{x, y), f{x, y, z)), generated by the first word w and recursive 
law /, is formed by the rule 

vi{x,y) = w{x,y), Vn+i{x,y) = f{x,y,Vn{x,y)). 

In such a situation, one can define, for any group G, a self-map GxGxG ^ GxGxGhy adding 
"tautological" variables. We arrive at the notion of verbal dynamical system {Q, (j), V) consisting of 

the following data: 

• a group scheme Q (in our case Q = SL(2,Z)); 

• a morphism (j): Q x Q x Q ^ Q induced by the word f{x, y, z); 

• an endomorphism (f>: QxQxQ^QxQxQ defined by ^{x,y,z) = {x,y,<p{x,y, z)); 

• a forbidden set V (in our case V = Q x G x {id}); 

• an initial word w: G x Q ^ Q. 
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Conversely, given such data, we reconstruct our iterative sequence {vnix,y)}. 
For each field F,, we consider the fibre (SL(2, q), 0g, V{¥q)). The sequence {vn} is good if for every 
q there are 0q-preperiodic points outside V{¥q). 
Consider three possible types of /. 

4.1.1. 1-valent law. The only 1-valent law is f{v) = v^. Then Vn+i = Wi". This sequence is not going 
down along the derived series and hence does not meet the necessary condition for characterizing 
solvable groups. 

4.1.2. PSL(2), 2-valent law. An example of a 2-valent law is sequence (3.2). In such a case our 
general setting can be simplified. Namely, if / = f{y, z) does not depend on x, we can restrict our 
verbal dynamical system to the form (j): QxQ^QxQ, {y, z) i-^ {y, f{y, z)), with the forbidden set 
G X {id} and initial word w{x,y) = x. 

4.1.3. Traces. In order to further simplify the dynamical system, one can use the trace map (see Sec- 
tion 2.3). In the special case d = 2, Theorem 2.1 may be formulated as follows. 

Theorem 4.2. Let G ^ SL(2,Z). Define n: GxG^ byTT{x,y) = (tr(a;), tr(a;y), tr(y)). Then for 
a word map (f): G x G ^ G, there is a polynomial in three variables P,/,(s, u, t) such that tr((^(a;, y)) = 

P4tT{x)Mxy)My))- 

Denote s = tr(a;),w = tr(a;y),t = tr(t/). 

Let /i(s, u, t) ^ tr((/)(x, y)), ji{s, u, t) = tr{(f){x, y)y), and V = (/i(s, u, t), /2(s, u, t),t). 
According to Theorem 4.2, we have the following factorization of the verbal dynamical system 
{Q,4>,V), i.e., the following commutative diagram: 

gxg A gxg 

■Ki i-K (4.1) 




In diagram (4.1), 

(1) TT is defined over Z; 

(2) TT is surjective for all Fg (see, e.g., [Mac] or [BGKJ]); 

(3) the set S of fixed points of being defined by the system 

/i(s,u,i) = s, f2{s,u,t)=u, 

has a positive dimension. 
Respectively, for every q we have a commutative diagram: 

SL(2, q) X SL(2, q) SL(2, q) x SL(2, q) 

TT X 4- 

Assume that S(Fg) contains an irreducible over Fg curve C of genus g and degree d that intersects 
7r(y) at k points. Then by Weil's inequality 

#(C \ 7r{V)){¥q) >{q+l)- 2g^ -d-k, 

and (C \ TT{V)){¥q) 7^ for g big enough, q > qoig,d,k). Let a € (C \ 7r(y))(F,) and Fa = 
7r~^(a) C SL(2,q) x SL{2,q) \ Vq. Then Fa is a (^^-invariant finite set and thus contains a nontrivial 
^g-preperiodic point. 
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Wc can now formulate a procedure for checking that a 2-valent sequence is good. Namely, 
assume that {u„} is defined by a word (law) /(y, z). The process consists of the following steps: 

(1) : compute the trace map ijj of the endomorphism <p; 

(2) : compute the set S of fixed points of ip; 

(3) : find an afhne curve 5 C E such that 

(i) : S is defined over Z; 

(ii) : S is irreducible over Q; 

(iii) : S is not contained in 7r(V). 

If the process succeeds, then one has to check finitely many remaining cases, namely, 

• for finitely many primes pi, . . . ,Ps one has to check whether C is irreducible over ¥p. (cases 
of bad reduction); 

• for all small fields F, with q < qo{g, d, k) one has to find a nontrivial ^-preperiodic point in 



All these steps can be performed by computer. 

Example 4.3. The process was performed for the sequence s„. The curve S contains a line in Tr{V) 
and two curves, each of genus 1 with 3 punctures. Thus, each of these two curves has a point over 
every field Vg, q> 9. 

Example 4.4. Using this process, it was proved that the 2-valent sequence 

ri{x,y) = X, rn+i{x,y) = [y'^xy~^,x~^] 

is good as well. Moreover, it was checked for the Suzuki groups too. Thus it characterizes finite 

solvable groups. 

4.1.4. 3-valent sequences. An example of a 3-valent sequence characterizing finite solvable groups is 
sequence (3.1). Consider any 3-valent sequence Vn = {w,f). As explained above, it gives rise to a 
morphism (f>: ^ G and an endomorphism (f): G^ G^, 4>{x,y,z) = {x.y, f{x,y, z)). 

The sequence is good if for every q there exists m = m{q) such that there is a solution in 
SL(2, q) X SL(2, q) to the equation 



where Vg = SL(2, q) x SL(2, q) x {id} denotes the forbidden set. 

As in Theorem 2.1, we express the trace of (j){x,y,z) as a polynomial in 7 variables ai = tr(x), 
^2 = tr(y), as = ti{z), a\2 = tr{xy), ais = tr{xz), 023 = tr{yz), 0123 = tr{xyz). These variables are 
dependent (see, e.g., [Magi] or formulas (2.3)-(2.5) in [Ho]): 



+ (tti + 02 + as + ai2 + flis + a23 - 0102012 - 0103013 - 02O3O23 + 012O13O23 - 4) = 0. 

Let a = (01,02,03,012,013,023,0123) € A'^. Let -!T{x,u,y) =aGZhe the trace projection. Let 
Z = n{Q^) C A^. Then we have a commutative diagram 



SL(2,g) x SL(2,g). 



vi{x,y) = Vm{x,y) ^ 1. 
This means that there is a pair {x, y) G SL(2, q) x SL(2, q) such that 

4>"'{x, y, w{x, y)) = {x, y, w{x, y)) Vg, 



ai23 - ^123(01203 + 01302 + O23O1 - O1O2O3) 



GxGxG 



^ GxGxG 



TT 



TT 



(4.2) 



Z 



Z 
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where = {ai,a2,h{a),ai2,h{a),l3{a),l4{a)), 



I3 = tT{(p{x, y, z)y), I4 = tr((/5(.x, y, z)xy). 



In [BGKJ] it is proved that Z is an irreducible hypersurface over any algebraically closed field 
and that tt is surjective for every F^. 

The dimension of the variety F{(f) C Z oi fixed points of tp is at least 3. The additional condition 
z = w{x,y) defines a 3-diniensional affine subset W{w) C Z. 

The sequence is good if {{F{ip)f]W{w)) \ V){¥g) ^ for all q big enough. Since dimZ = 6, 
dim F{(p) = 3, dim W{w) = 3, one should expect that {{F{(p) f] W{w)) is zero-dimensional. However, 
it turns out that for the sequence u„ defined by (3.1) it is an absolutely irreducible curve! Thus we 
can formulate a sufficient condition for a 3-valent sequence to be good. 

Theorem 4.5. Let w,f define a sequence {vn{x,y)}. Let F{ip) be the variety of fi,xed points of the 
trace map ijj of the corresponding endom,orphism, (p {see diagram (4.2)), and let W{w) he defined by 
w. Let V = {a2 = 2, ai = 012, as = 023, ai3 = 1123}- Assume that F{(p) f] W{w) contains a positive 
dimensional, absolutely irreducible Q-subvariety $ ^ y. Then there is qo such that for every q > qo 
there exists a (j)-preperiodic point in SL(2, g)^ \ Vq. 

Once again, this theorem provides a finite process, which may be performed by computer, deter- 
mining whether a sequence is good. 

It is a conceptual challenge to understand whether or not the property of a sequence to characterize 
finite solvable groups is generic in some reasonable sense. We suspect that this question can be 
answered in the affirmative, in the sense that for almost every law f(x,y,z), satisfying necessary 
conditions, there exists a first word w{x, y) such that the resulting sequence is as required. Of course, 
one has to make more precise what is meant by "almost every law". See Conjecture 7.1 below. 

4.2. Mapping tori of endomorphisms of free groups. Below we describe in brief another 
spectacular application of arithmetic dynamics to group theory, following [BSl]. 

Given a group G with generators xi, . . . ,Xd and a relation set i?, and its injective endomorphism 
(j) taking Xi to a word wi {i = 1, . . . ,d), the mapping torus T of is defined as the group extension 
of G obtained by adding a generator t subject to the relations txit~^ = Wi. One can ask whether T 
is residually finite (this property means that the intersection of the subgroups of finite index of T is 
trivial). We refer to [Ka] and [BSl] for the history and context of this problem. 

The following theorem answers this question in affirmative in the case where the set R is empty, 
i.e., G is a free group. 

Theorem 4.1. The mapping torus of any injective endomorphism of a free group is residually finite. 
Here are the main steps of the proof. 

• By the definition of the mapping torus of an injective endomorphism, it is enough to prove 
that for any w & Td and any positive integer a there is a homomorphism h: T ^ H to a, 
finite group H such that 



• Any homomorphism hg of Td to SL(2) is defined by a point g = {gi, . . . ,gd) € SL(2)''. Then 




(4.3) 



hg{xi) = gi, hg{w{xi, . . ■,Xd)) = w{gi,. . . , 



9d)- 



• Define «>: SL(2)'' ^ SL(2)'^ by 



$(S1, ...,Sd) = {Wi{si, . . .,Sd), ■ ■ ■,Wd{si, . . ■,Sd)). 



Then 



/i,(</.(«)(«;))=^$W(5i,...,5<i)). 



(4.4) 
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• Take a field big enough and consider the endomorphism $g : SL(2, qY SL(2, q^. Then 
(4.4) is still valid. 

• Thus any point g G SL{2,q)''' such that 

— it is periodic for $q, 

— T^w{g) ■■= w{g) = w{gi,. ..,ga) j^l, 
will have property (4.3). 

• Let y C SL(2)'' be the Zariski closure of $(4'*)(SL(2)'^). Then 

— V is ^-invariant, 

— $ : V is dominant, 

— ^„(y) ^{id,-id}. 

• Let Z = V \ 7r^^({id. — id}) and fix ¥q. By a theorem of Hrushovski [Hr, Cor. 1.2], there is 
an extension ¥q^ of and a $gj-periodic point g € Z(FgJ. 

In order to avoid Hrushovski's theorem, the authors prove its particular case for endomorphisms 

of afRne space. That is why they consider SL(2) as a subscheme of the scheme M2 of all 2 x 2 
matrices. Note, however, that using Hrushovski's theorem, they obtain a stronger result where 
residual finiteness is established for the mapping torus of any endomorphism of any finitely generated 
linear group. Moreover, a further refinement of their method in [BS2], involving, in particular, search 
of periodic points of self-maps defined over p-adic fields, allows one to get more precise information 
on the structure of the mapping tori. 

Remark 4.6. Note that in the framework of the dynamical approach presented above it is permitted 

to look for periodic points defined over some extension of the original ground field. This is a subtle 
but important difference from the method described in Section 4.1 where such an extension is 
forbidden, which prevents from using Hrushovski's theorem. 

5. Word maps: image and fibres 
In this section we focus on the case of finite simple groups asking the following questions: 

(1) How big is the image of a word map? 

(2) What are the sizes of the fibres of a word map? 

These questions are interrelated: one can prove that the image is large by estimating the sizes of 
fibres. 

We start, however, with recalling a seminal result by Borel [Bo] where a general answer to the 
first question was obtained for connected semisimple algebraic groups. 

5.1. Borel's theorem. 

Theorem 5.1. [Bo]. Let G be a connected semisimple algebraic group defined over a field K. Let 

fu: '■ G'^ G be the map associated to a nontrivial element w of the free group on d > 2 letters. 
Then fw is dominant, i.e., its image is Zariski dense in G. 

Here are the main steps of the proof. 

(1) Since dominance is preserved by field extension, one may assume that K is algebraically 
closed of arbitrary transcendence degree. 

(2) One can show that the assertion of the theorem docs not depend on the choice of G within 
its isogeny class, so one may assume G is simply connected. One can also easily reduce to 
the case where G is simple. 

(3) There is a d-tuple {gi, . . . ,5^) e G'^ such that fwidi, - ■ ■ ,9d) 1 (because a simple group 
has no identical relations). 

(4) First consider the case G = SL„. 
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• The following observation, going baek to the unitary trick of Weyl and used in [DS], is 
crucial: one can find a subfield L C K and a division L-algebra D so that the group 
G = SL„ contains its anisotropic form H = SLi(Z)), the group of elements of D of 
reduced norm 1, as a dense subset. 

• One proves that 

if id 7^ ft, G iJ, then 1 is not an eigenvalue of h. (5-1) 

• Let imfyj = Z be the closure of im/^,. One should prove that Z = SL{n,K). Since H 
is dense in G, 

{0}y^U{H'^)cZnH. (5.2) 

• Use induction on n. If n = 2 and Z ^ G, then dim Z < 2 and Z is a union of a 
finite number of conjugacy classes of (non-identity) semisimple elements and the set U 
consisting of unipotents. Since id G Z and Z is irreducible, it is contained in U, which 
contradicts (5.1). Hence, for n = 2 the statement is valid. 

• Assume that the statement is proved for n < m— 1. Fix a maximal torus T C SL(m, K). 
Let T' c T be the union of subtori consisting of elements with at least one eigenvalue 
1. T' is a hypersurface in T. By the induction hypothesis, Z D T'. On the other hand, 
by (5.2), there is a d-tuple {hi,. . . , ha) € H''^ such that /«,(fti, . . . , hd) G T\T'. Since 
SL{ni,KY is an irreducible variety, Z should be irreducible as well. Therefore, if it 
contains a hypersurface T' and at least one point outside T', it contains T. Thus, 
Z = SL{m,K) since the conjugates of T are dense in SL{m,K) and im/^ is invariant 
under conjugation. 

(5) Any simple group of rank r not isogenous to SL„ contains a subgroup of the same rank 
r which is isogenous to a direct product of groups SL„ . , and the assertion of the theorem 
follows from the previous step. 

Remark 5.2. See [KBKP] for an alternative proof based on Amitsur's theorem on generic division 
rings [Am]. 

5.2. The image of the word map on finite simple groups. From now on G is a finite simple 
group, w{xi, . . . ,Xd) G is a nontrivial word, and we shorten our previous notation so that 
w: G'^ ^ G denotes the corresponding word map. 

It turns out that one can provide an analogue of Borel's theorem. Although dominance does not 
make any sense in this context, it was shown by Larsen [La] that the image of the word map is large. 

Theorem 5.3. [La] For every nontrivial word w and any e > there exists N = N{w, e) such that 
if G is a finite simple group of order greater than N, then 

#w{G) > #Gl-^ 

The original proof of this theorem heavily relied on the techniques of Larsen-Pink [LP] for es- 
timating the sizes of the fibres of w in the case where G is of Lie type. The case G = An always 
requires separate consideration, and sporadic groups can be ignored whenever one restricts attention 
to asymptotic questions. 

Later on, this theorem was reproved by Larsen and Shalev [LaS2] in the case of groups of Lie 
type using more traditional methods of arithmetic geometry, such as Lefschetz's trace formula (not 
including, however, Suzuki and Ree groups, successfully treated in [LP]). 

In the same paper [LaS2], Larsen and Shalev considered some other variations on estimating the 
size of w{G). Namely, if G is a group of Lie type (different from A,, or ^A^.) of fixed Lie rank r, they 
obtained an estimate of the form 

#w{G) > cr-^#G. 
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Here c is a positive absolute constant and G is of order greater than N{'w). For the alternating 
groups the estimate is slightly weaker. 

As there is no hope to establish surjectivity of the map w for arbitrary words (power maps provide 
easy counter-examples), one can try to say something more about ui(G). The following terminology, 
reminiscent of classical number theory, was introduced by Shalev. 

5.2.1. Waring type properties. [Sh2, LaS2, LSTl, LST2, KN] 

Waring's problem deals with expressing every natural number as a sum of f{k) k*'^ powers for 
some suitable function /. Noncommutative analogues of this problem were investigated during the 
past years, answering the following questions: Can one write any element of a finite simple group as a 
product of f{k) k*'^ powers? (See [MZ], [SW].) Can this result be extended by replacing the word x'' 
with an arbitrary nontrivial word wl Can these results be improved by replacing the function / with 
a global (small) constant? See [LSTl] and the references therein, and the most recent improvements 



Using lots of various methods and ingenious techniques, Shalev showed in [Sh2] that for every 

nontrivial word w there exists a constant N{w) such that if G is a finite simple group of order greater 
than N{w) then w{G)^ = G. Two alternative proofs of this result were recently found. The first, 
due to Nikolov and Pyber [NP], is based on a recent result of Gowers [Gow], and the second, for 
finite simple groups of bounded Lie rank, due to Macpherson and Tent [MT] , relies on model theory. 
This result was substantially improved by Larsen, Shalev and Tiep in [LaSl, LaS2, LSTl]. 

Theorem 5.4. [LSTl] For any nontrivial word w there exists a constant N{w) such that for all 
finite non-abelian simple groups G of order greater than N{w) we have 



The particular case oi w = x'' shows that this is the best possible Waring type result for powers. 

Conjecture 5.5 (Shalev). [Shi, Conjectures 2.8 and 2.9] Let w ^ 1 be a word which is not a proper 
power of another word. Then there exists a number N{w) such that, if G is either An or a finite 
simple group of Lie type of rank n, where n > N{w), then w(G) = G. 

A recent result of Kassabov and Nikolov [KN] shows that the assumption in Theorem 5.4 that 
G is sufficiently large cannot be removed, even if we only require that G = w{G)'' for a fixed k. 
Indeed, it is shown in [KN] that for any integer k there exist a word w and a finite simple group G, 
such that w is not an identity in G, but G 7^ w{G)^ . This is done by constructing for any n > 13 
a specific word w ^ Ti such that w(^„) consists of the identity and all 3-cycles. The result follows 
since for n > 2A; + 1 there are elements in An which cannot be written as a product of less than 
fc + 1 3-cycles. 

Further examples of word maps on SL(2, 2") whose image is very small (consisting of the identity 
and a single conjugacy class) have been constructed in [Lei] . In a more recent preprint [Lu] , Lubotzky 
proved (assuming the classification of finite simple groups) that any given subset of a finite simple 
group G which contains the identity and is invariant under Aut(G) can arise as the image of some 
word map. In [Le2], this result was extended to some almost simple and quasisimple groups. 

The main tools in [Sh2, LaS2, LSTl] involve representation theory, algebraic geometry and proba- 
bilistic methods. For any two nontrivial words W\.,wi the rough idea is to construct special conjugacy 
classes Ci , G2 C G satisfying: 



in [GT2]. 



w{Gf = G. 



GiC«;i(G), G2C«;2(G), 



(5.3) 



and 



CiG2DG\{l}. 



(5.4) 
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The proof of (5.4) relics on the following classical result of Frobenius. Let Ci = sf, C2 = 82- 
The number of ways to write a group clement g ^ G as g = X1X2, where € C,, is given by 



where Irr(G) denotes the set of irreducible complex characters of G. 

The case of the alternating groups was established by Larsen and Shalev in [LaS2]. First, it was 
proved that for any w ^ 1 there exists N{w) such that if n > N{w) then the image w{An) contains 
the conjugacy classes of permutations s„ with a few cycles (at most 23), thus implying (5.3). This is 
highly non-elementary, involving algebraic geometry and results from analytic number theory (such 
as weak versions of the Goldbach Conjecture). The idea is to embed groups of the form SL(2,p) and 
their products into An, basing on the fact that SL(2,p) embeds into ^p+i, and an element of order 
{p — l)/2 in SL(2,p) has two nontrivial cycles and two fixed points in this embedding, and then use 
the following property of word maps on SL(2,p). 

Theorem 5.6. [LaS2, Theorem 4.1] For every nontrivial word w there exist constants and 
with the following property: for every prime p > Myj, such that p—1 is divisible neither by 4 nor by 
any prime 3 < I < rriw, w{SL{2,p)) contains an element of order {p — l)/2. 

Inclusion (5.4) can now be obtained by combining (5.5) with the fact that all character values 
x(s) of a permutation s € Sn can be bounded in terms of the number of cycles alone. 

Theorem 5.7. [LaS2, Theorem 7.2] Let s £ Sn be a permutation with k cycles {including 1-cycles). 



for all irreducible characters x of Sn- 

Larson and Shalev [LaS2] also treated the case of finite simple groups of Lie type of bounded 
rank. Later on, Larsen, Shalev and Tiep [LSTl] completed the proof for finite simple groups of Lie 
type of arbitrary rank. 

For finite simple groups of Lie type, Ci, G2 are the classes of suitable regular scmisimplc elements 
si,S2 G G lying in maximal tori Ti,T2 C G. The tori Tj are chosen so that if x is an irreducible 
character of G such that x(*i)x(*2) 7^ then x is unipotent; moreover, there is a small (in particular, 
bounded) number of such unipotent characters. These results arc obtained using the machinery of 
Deligne-Lusztig, see, e.g., [DM]. This implies that the number of nonzero summands in (5.5) is 
small, and moreover, the character ratios can be bounded: 

Theorem 5.8. [LSTl, Theorem 1.2.1] IfGis a finite quasisimple classical group overFq and g G G 
is an element of support at least N, then 



for alllG^X&MG)- 

(Here the support of g is defined as the codimension of its largest eigenspace, see [LSTl, Defini- 
tion 4.1.1]. Recall that a quasisimple group G is a perfect group such that G/Z{G) is simple.) 

The proof of (5.3) is based on geometric, tools, and in particular, on the Lang-Weil estimate, that 
allows one to establish a Chebotarev Density Theorem for word maps. 

Theorem 5.9. [LSTl, Corollary 5.3.3] For every fixed nontrivial word w and fixed integer N , there 
exists S > such that for every semisimple algebraic group G of dimension less than N over a finite 
field Fq and every maximal torus T 0/ G defined over ¥q, we have 




(5.5) 



Then 



\x{s)\ < 2^-^k\ 



|X(5)I/X(1) < q 



-^/N/A%l 



#(T(Fg)n«;(G(F,))) >5#T(F,). 
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Hence, if q is sufScicntly large, there exist rcigular scrnisimplc cilcmcmts Si € Wi(Gr{Fq)) lying in 
any prescribed maximal torus T(Fg). This itself is not enough, since the group G is of unbounded 
Lie rank. This obstacle is treated by embedding groups H of very small rank (such as SL2) over 
large extension fields into G so that Si € Wi{H) remains regular semisimplc in G, and lies in the 
required maximal torus Tj of G. Clearly Sj € Wi{G) so that Wi{G) contains the conjugacy class 
Cj = sf , as required. 

Similar Waring type results were obtained by Larsen, Shalev and Tiep in [LST2] for quasisimple 
groups. 

Theorem 5.10. [LST2] For a fixed nontrivial word w there exists a constant N{w) such that if G 

is a finite quasisimple group of order greater than N(w), then w{G)'^ = G. 

For various families of finite quasisimple groups, including covers of alternating groups, a stronger 
result was proved in [LST2], namely that w{G)'^ = G. This was recently finalized by Guralnick 
and Tiep [GT2], who proved that for any nontrivial word w there exists N = N(w) such that 
w{G)'^ I) G \ Z{G) for all quasisimple groups G of order greater than N. Note, however, that in 
contrast with the case of simple groups studied in [LSTl], the equality w(G)^ = G may not hold for 
all large finite quasisimple groups G. The nontrivial central elements of finite quasisimple groups G 
provide the main obstructions (see Sections 5.2.3 and 6.f for examples). 

Further variations on the Waring theme, discussed in [Shi] and [LaS2], include considering prod- 
ucts wi{G) . . . Wk{G) and intersections wi{G) H • • • n Wk{G) where Wi are distinct words. The latter 
case can be fit into the same framework by looking at the fibre product of the word maps : G'^ G 
over G (a different approach was suggested in [NP]). The reader is addressed to the original papers 
for details. 

Some counterparts of Waring type properties discussed above can be formulated for maps of 
matrix algebras induced by associative noncommutative polynomials, see [KBKP] for a survey. 

Remark 5.11. In [LaS3], Larsen and Shalev obtained a general estimate for the size fl^N^^g) 
of the fibres of word maps: for all w S J"'', w 7^ 1, there exists £ > such that for all finite 
simple groups G and all g G G we have #Nw{g) = 0{jj=G'^^^), where the implicit constant depends 
only on w. Naturally, these universal estimates are rough in comparison with the equidistribution 
results because they hold for all nontrivial words, including power words, which are far from being 
equidistributed, and also for all elements in the group, including 1, for which the fibre may be very 
large (as in the case of commutators). 

Remark 5.12. In a different spirit, estimates for the size of the fibres of word maps were used 
in [Ab] and [NSl], where they yielded new criteria for distinguishing finite nilpotent and solvable 
groups. (We thank the referee for pointing out the references mentioned above.) 

5.2.2. Commutators. [Or, ThR, EG4, LOSTl, L0ST3, GM] 

In this and next sections we consider the image of word maps for some special words w. First 
note that for any primitive word w (this means that w; is a part of a free generating set of J-^), as 
well as for any word of the form w = x'^ . . •x^''/, where the Cj are coprime and / belongs to the 
derived group J^^, the induced map w: G'^ — > G is surjective for an arbitrary group G (see, e.g., 
[Seg, 3.1.1]). The commutator word is the first nontrivial instance of the surjectivity problem. 

Theorem 5.13 (Ore's Gonjecture). If G is a finite non-abelian simple group, then every element 
of G is a commutator. 

In other words, for the commutator word w = [x,y\ e J2, one has w{G) = G for any finite 
non-abelian simple group G. This statement was originally posed in 1951 and proved by Ore himself 
for the alternating groups [Or]. During the years, this conjecture was proved for various families of 
finite simple groups (see [LOSTl] and the references therein). R. Thompson [ThR] established it 
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for the linear groups 'PSL{n,q). later EUers and Gordeev [EG4] proved the conjecture for all finite 
simple groups of Lie type defined over a field with more than 8 elements, and recently an impressive 
full stop was put by Liebeck, O'Brien, Shalev and Tiep [LOSTl] who completed the proof for all 
finite simple groups. 

The original proofs of Ore [Or] and R. Thompson [ThR] were obtained by explicitly finding pairs 
of permutations (respectively, matrices) whose commutator corresponded to some representative in 
any given conjugacy class. 

In order to complete the proof of Ore's Conjecture, Liebeck, O'Brien, Shalev and Tiep used in 
[LOSTl] the following classical criterion dating back to Probenius, that the number of ways to write 
an element 5 in a finite group G as a commutator is 



Roughly speaking, it was shown that if g is an element with a small centralizer, then x(.9)/x(l) 
is small for x I, and the main contribution to the character sum (5.6) comes from the trivial 
character x = 1- Hence, this sum is positive, so elements with small centralizers are commutators. 
This is based on the Dclignc Lusztig theory, and also on the theory of dual pairs and Weil characters 
of classical groups [TZ], [GTl]. For elements whose centralizers arc not small, the strategy is to 
reduce to groups of Lie type of lower dimension and use induction. Namely, if a certain element has 
a Jordan decomposition into several Jordan blocks, and if it is possible to express each block as a 
commutator in the sniallcir classical group, then clearly the original element is itself a commutator. 

Computer calculations (using MAGMA) played a significant role in the proof of [LOSTl]. Since 
the proof uses induction, it was necessary to establish various base cases. The conjecture was proved 
directly for many of these base cases by constructing the character table of the relevant group. For 
various other groups certain elements with prescribed Jordan forms as commutators were explicitly 
constructed. 

Similar methods were used in the subsequent paper [L0ST3], in which it was shown that with a 
few (small) exceptions, every element of a finite quasisimple group is a commutator, and moreover, 
any such element is a product of two commutators. 

Ellers and Gordeev [EG4] have proved, for the finite simple groups of Lie type over fields with 
more than 8 elements, a stronger conjecture, known as Thompson's Conjecture. 

Conjecture 5.14 (Thompson's Conjecture). Every finite simple group G has a conjugacy class C 

such that = G. 

Observe that Thompson's conjecture immediately implies Ore's conjecture. Indeed, if = G 
then 1 G so C^^ — C and G ~ CC^^. Hence, for any g E G there exist x G C and h G G such 
that g = x^x~^ = [h,x\, as required. 

Thompson's conjecture was verified for many families of finite simple groups, including the alter- 
nating groups and the sporadic groups, see the introduction of [EG4], but nevertheless it is still very 
much open today. 

The proof of Ellers and Gordeev is based on the following generalization of the Gauss decompo- 
sition of matrices. 

Theorem 5.15. [EG1]-[EG3] Let G be a Chevalley group, and let T be a group generated by G and 
a cyclic group (a) which normalizes G in T and acts as a diagonal automorphism on G {perhaps 
trivially) . 

Let J = ag € T \ Z{T). If h is any fixed element in the group H, then there is t € G such that 
T^T~^ = (ju\hu2, where ui G U~ and U2 G U. 




Xelrr(G) 



(5.6) 
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Here H, U and U are the subgroups of G defined by 

H={ha:aGll), U={X^:aG $+), C/" = (X„ : a e 

where $ is the root system corresponding to G and 11 denotes the simple roots of Recall that 

the Chcvallcy group G is generated by the root subgroups Xa, a (E <f>. 

As a consequence of Theorem 5.15, one easily gets a statement in the spirit of inclusion (5.4): 

Corollary 5.16. If h\,h2 G H are regular semisimple elements in G from a maximal split torus 
and Ci, C2 are the respective conjugacy classes, then 

C1C2 2 G\Z{G). 

Indeed, by [EGl, Proposition 1], for fixed hi, /12 any ui € and U2 G C/ can be represented as 

Ml = vihiv^'^h^'^ and U2 = h2^V2h2V2^ , 

for some vi e U~ and V2 S U. Thus by Theorem 5.15, for any noncentral conjugacy class C C G 
one can find a representative c G C such that 

C= Uihih2U2 = {vihiV^^h^^)hih2{h2^V2h2V2^) = ivihiVi^){v2h2V2^). 

Corollary 5.16 immediately implies Ore's conjecture for any simple group G containing a regular 
semisimple element /i in a maximal split torus, and Thompson's conjecture if this element is in 

addition real, i.e., if h and h~^ arc conjugate. In [EG4] a careful analysis is done to show that such 
desired elements actually exist in groups of Lie type over fields with more than 8 elements. 

In addition, Guralnick and Malle [GM] have extended the aforementioned result (5.4) from [LSTl] 
and proved the following variant of Thompson's conjecture. 

Theorem 5.17. [GM, Theorem 1.4] If G is a finite non-abelian simple group, then there exist 
conjugacy classes Ci, C2 in G with 

G\{l} = CiC2. 

Moreover, aside from G = PSL(2, 7) or PSL(2, 17), one can assume that each Ci consists of elements 

of order prime to 6. 

Similarly to [LSTl], the proof in [GM] also relies on estimating the character sum (5.5) using the 
Deligne-Lusztig theory, or for some small rank groups it is computed directly from known character 
tables, to show that triples {x-i,X2,g) of elements from specified conjugacy classes Ci exist in a 
given group G. The conjugacy classes Ci are chosen so that only few irreducible characters vanish 
simultaneously on these classes. These triples moreover generate G, since the conjugacy classes Cj 
were chosen so that their elements are contained in few maximal subgroups of G. 

After considering the commutator word, it is natural to go over to the Engel words, defined 
recursively by 

ei{x,y) = [x,y] = xyx^^y^^, e„(x,y) = [e„_i,y], (5.7) 

and the corresponding Engel word maps Cn'- G x G ^ G. The following conjecture is naturally 
raised. 

Conjecture 5.18 (Shalev). Let n G N, then the n*'* Engel word map is surjective for any finite 
simple non-abelian group G. 

See Section 6 for the cases G = SL(2, q) and PSL(2, q). 
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5.2.3. Two-power words. [LSTl, GM, L0ST2, LST2] 

It follows from Theorem 5.4, due to Larsen, Shalev and Tiep [LSTl], that any two-power word is 
surjective on sufficiently large finite simple groups (see [LSTl, Theorem 1.1.1 and Corollary 1.1.3]). 
More precisely: 

Theorem 5.19. [LSTl] Let a,b he f,wo nonzero integers. Then there exists a nvmher N = N(a,b) 
such that if G is a finite non-abelian simple group of order at least N, then any element in G can 
be written as x""])^ for some x,y £ G. 

Furthermore, by recent results of Liebeck, O'Brien, Shalev and Tiep [L0ST2] and of Guralnick 
and Malle [GM] (see Theorem 5.17), some words of the form x^y'' are known to be surjective on all 

finite simple groups. 

Theorem 5.20. [GM, Corollary 1.5] Let G be a finite non-abelian simple group and let b be either 
a prime power or a power of 6. Then any element in G can be written as x^y^ for some x,y G G. 

Note that in general, the word x^y^ is not necessarily surjective on all finite simple groups. Indeed, 
if b is a multiple of the exponent of G then necessarily x^y^ = 1 for every x, y G G. 

In addition, by another recent work of Larsen, Shalev and Tiep [LST2] , if G is a finite quasisimple 
group, then the word w = x^y"^ is surjective on G. On the other hand, if 6 > 2 then the word w = x^y^ 
is not surjective on infinitely many finite quasisimple groups. 

5.3. The fibres of the word map. Recall that estimating the sizes of the fibres of the word map 
appeared as an integral part of estimating its image. In this section we address a subtler problem 
trying to distinguish words for which the fibres of the corresponding word map are of the same size, 
at least approximately. First note that for certain words w all fibres of the word map w: G''^ ^ G are 
exactly of the same size for any finite group G. According to recent results of [Pu] , [PP] , this holds 
only for primitive words. Primitive words are asymptotically very rare (exponentially negligible, in 
the terminology of [KS]): if we count them among all words of fixed length, their proportion tends 
to exponentially fast (see, e.g., [MS]). Another viewpoint at the set of primitive elements oi is 
that this set is closed in the profinite topology of Fd [PP]- 

We are interested in weaker equidistribution properties which hold for more general words. 

5.3.1. Equidistribution and measure-preservation. [GS, L0ST3] 

For a word w = w{xi, . . . , Xd) € Fd, a finite group G and some g G G, we denote 

Nwig) = {{gi,...,gd) e G'^ w{gi, . . . ,gd) = g}. 

Definition 5.21. A word map w: G'^ ^ G is almost equidistributed for a family of finite groups Q 
if any group G e Q contains a subset S = So G with the following properties: 

(z) #5 = #G(1 - e(G)), 

(ii) #N^{g) = i#GY~\l + e(G)) uniformly for all g G S, 
where e(G) whenever #G — >■ oo. 

Theorem 5.22. [GS, Theorem 1.5] The commutator word w = [x,y] G F2 is almost equidistributed 
for the family of finite simple groups. 

Note that we cannot require in this theorem that S = G. Indeed, it is well known (and follows 
from (5.6) above) that for w = [x,y] we have 

N^{1) = k{G)#G 

where k{G) is the number of conjugacy classes in G. Since k{G) — )• 00 as #G — )• 00 we see that the 
fibre above g = 1 is large and does not satisfy condition (ii). 

Two proofs are given in [GS] for this theorem. The first is probabilistic whereas in the second 
the subsets S are explicitly constructed. 
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Let P ~ P'^ be the commutator distribution on G, namely P{g) = Nyj{g)/^G'^ , and let U = U'^ 
be the uniform distribution on G (so U{g) = 1/ #G for all g £ G). The probabilistic proof bounds 
the Li-distance 

\\P-U\U = J2\P{g)-U{g)\ 

geG 

between the probability measures above. Using Frobenius formula (5.6) and the Cauchy-Schwarz 
inequality, it is deduced in [GS, Proposition 1.1] that 

\\P^-U^\\i < {C^{2)-iy/\ 

where 

Xelrr(G) 

is the so-called Witten zeta function. 

Now Theorem 5.22 follows from results of Liebeck and Shalev [LiS2], who showed that if G is 
simple, then 

C'^{s) -J> 1 as #G -> oo provided s > 1. 

This proof also provides some estimation of the function e. Namely, e(A„) = 0{n^^^^), and if G is 
of Lie type of rank r over a field with q elements then e(G) = 0{q~^^^). 

The second, constructive proof describes the subsets S explicitly. If G = PSL(2,g) then Theo- 
rem 5.22 follows from (5.6) directly using the well-known character table of G. If G / PSL(2, q) 
is a group of Lie type of bounded Lie rank then S — Sq is chosen as the set of regular scmisimple 
elements of G. If G is a group of Lie type with unbounded Lie rank, then S = Sg contains elements 
whose centralizer is not very large. If G = ^„ then S = Sg is chosen as the set of permutations 
in An with at most ^/n fixed points. This yields better lower bounds on the cardinality of S. For 
example, in the constructive proof for An we obtain #5" > (1 — 2/[\/n]!)#A„, which is much better 
than the lower bound (1 — 0{n~^^'^))4^An given by the probabilistic proof. 

Similarly, it was shown in [L0ST3, Proposition 3] that the commutator word is also almost 
equidistributed on the family of finite quasisimple groups. 

In [GS, Section 7] it is proved that the property to be almost equidistributed behaves well under 
direct products and compositions, implying that the words w = [xi, . . . , xj] G Td, rf-fold commu- 
tators in any arrangement of brackets, arc almost equidistributed within the family of finite simple 
groups. Similar methods are used in [GS, Theorem 7.1] to show that the word w = x^y^ is almost 
equidistributed on finite simple groups, and Larsen and Shalev have recently obtained similar results 
in more general contexts. 

By [GS, Section 3], any "almost equidistributed" word map w: G'' — >■ G (see Definition 5.21) is 
also "almost measure preserving" in the following sense. 

Definition 5.23. A word map w: G'' — >■ G is almost measure preserving for a family of finite groups 
Q if every group G £ Q satisfies the following conditions: 

(i) for every subset y C G we have 

#w-\Y)li^G'' = W/*G + 0(1); 

(ii) for every subset X C G'' we have 

#w;(X)/#G > #X/#G'^ - 0(1); 

(iii) in particular, if X C G^ and #X/#G'^ = 1 — o(l), then almost every element g £ G can be 
written as g = w{gi, ...,gd) where (51, . . . , 5d) e X; 

here o(l) denotes a function depending only on G which tends to zero as #G 00. 
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This allows one to deduce in [GS, Corollary 1.6] that the commutator map is almost measure 
preserving on the family of finite simple groups. Since almost all pairs of elements of a finite simple 
group are generating pairs (see [Di], [KL], [LiSl]), the probability that some g G G can be represented 
as a commutator g = [.r.y], where x,y generate G, tends to 1 as ^G — > oo, by [GS, Theorem 1.7]. 

It seems to be not so easy to extend equidistribution results from commutators to more general 
words. See, however, the next section where this is done in the particular cases G = SL(2, q) and 
PSL(2,g). 



6. Word maps on SL(2,g) and PSL(2,g) 

6.1. Surjectivity for Engel words and some positive words. In [BGG] the particular ease of 
Engel words in the groups PSL(2, q) and SL(2, q) was analyzed using the trace map method, in an 
attempt to prove Conjecture 5.18 for PSL(2, q). The main idea was to check the surjectivity of the 
trace map on A'^(Fg) instead of the surjectivity of the word map on SL{2,q). 

For any x,y & G = SL(2, g) denote s = tr(x), t = tr(y) and u = tr(a;y), and define a morphism 
tt: G X G ^ ^l,u,t by 7r(a;, y) := (s, u, t). Then the following diagram commutes: 

GxG ^ GxG 

TT 4- 4- 

In this diagram, the maps and ip"' are defined recursively as follows. 

• Hx,y) = {[x,y],y) =^ 

%l){s, u,t) = (si, t, t), Si = + + — sut — 2; 

• 4''^{x,y) = i[[x,y],y],y) =^ 

llP'{s,U,t) = ll){si,t,t) = (S2,t,t), 

S2 = r{sut) = sl + 2t^ - sit^ -2; 

• (t>^{x,y) = {en{x,y),y) =^ 

■tlj"'{s,U,t) =tp"'-'^{si,U,t) = ■■■ = V(Sn-l,t,i) = {Sn,t,t), 

Sn = r{sn-ut) = r{r{sn-2,t),t) = ... =r("-i)(si,i), 

where r(s, t) := + 2t^ - st^ - 2. 

Let ±2 7^ a e Fg. It is proven in [BGG] that a matrix z € SL(2, q) with tr(2;) = a can be written 
as 2; = e„+i(a;, y) for some x,y & SL(2, q) if and only if there is a solution {si,t) G of the equation 
A"'\si,t) = a, that is if and only if the curve G„,o defined over Fg by the equation r^'^\si,t) = a 
has a rational point. 

In [BGG] it was shown that the curve Cn,a is absolutely irreducible over any finite field Fg. In 

addition, its genus satisfies the inequality g{Cn,a) < 2"(n — 1) + 1, and it has at most (5 = 5-2" 
punctures. By Weil's inequality, for a ^ ±2 and q > 2^""'"3(n— 1)^ we have Cn^a^q) 7^ 0) as required. 

This implies the following results, obtained in [BGG]. 

Theorem 6.1. The n*'' Engel word map is surjective on SL(2, q) \ {— id} {and hence on PSL(2, q)) 
provided that q > qo{n) is sufficiently large. 

On the other hand. 

Proposition 6.2. There is an infinite family of finite fields Fg such that if n > no{q) is large 
enough, then the n*^ Engel word map is not surjective on SL(2, q) \ {— id}. 



24 



TATIANA BANDMAN, SHELLY GARION, BORIS KUNYAVSKII 



Proposition 6.3. For every odd prime power q there is no = no{q) such that en{x,y) ^ — id for 

every n > uq and every x,y Cz SL(2, q). 

Indeed, z = en+i{x,y) = — id implies tliat tliere is a solution to the equation r^"\si,0) = —2, 
and then there is c e Fg2 such that = — 1. 

In certain cases, e„ is always surjective on PSL(2,g'). 

Proposition 6.4. e„ is surjective on SL(2,2®) = PSL(2,2*). 

Indeed, ii q = 2^, take t = and then r(s, 0) = s^, so r"(s, 0) = s^" is an isomorphism, implying 
that e„ is surjective on PSL(2,q). 

Proposition 6.5. e„ is surjective on PSL(2,(7) if € F^, -y/— 1 ^ ¥q. 
Proposition 6.6. If n<A then e„ is surjective for all groups PSL(2,(7). 
The last result is a consequence of Theorem 6.1 and MAGMA calculations. 

In [BG] there is a precise description of the positive integers a, b and prime powers q for which 
the word map w{x, y) = x^y^ is surjective on the group PSL(2, q) (and SL(2, q)). The proof is based 

on the investigation of the trace map of positive words. 

The key result is that ti{x°'y^) is a linear polynomial in u, namely: 

tr(a;"y'') = u ■ fa,b{s, t) + ha,b{s, t), 

where fa,b{s,t),ha,b{s,t) € Fg[s,t]. 

Thus, if neither a nor b is divisible by the exponent of PSL{2,q), then any element in F, can 
be written as tr{x°-y^) for some x,y E SL(2,<7). This immediately implies that in this case, any 
semisimple element (namely, z £ SL(2, q) with tr(2;) ^ ±2) can be written as 2: = x'^y^ for some 
x,y G SL(2, q). However, when z is unipotent (namely, z 7^ ± id and tr(.2) = ±2) one has to be more 
careful, and a detailed analysis is needed. Indeed, it may happen that neither a nor b is divisible by 
the exponent of PSL(2, g), but nevertheless the image of the word map w = x°'y^ does not contain 
any unipotent. For example, the word w = x^^y^^ is not surjective on PSL(2,7) and PSL(2,8). 

In addition, it was determined when — id can be written as for some x,y E SL(2,g). In 

particular, if g = ±3 mod 8, then x'^y'^ ^ — id for every x,y £ SL(2, q) (the same result was obtained 
independently in [LST2]). 

These results demonstrate, in particular, the difference between word maps in simple and qua- 
sisimple groups (see also the previous discussion in Sections 5.2.1, 5.2.2 and 5.2.3). 

6.2. Criteria for equidistribution. In this section we describe some results on equidistribution 
of solutions of word equations of the form w{x, y) = g in the family of finite groups SL(2, q) which 
were obtained in [BK]. A criterion for equidistribution in terms of the trace polynomial of w is 
given in Theorem 6.18 below. This allows one to get an explicit description of certain classes 
of words possessing the equidistribution property and show that this property is generic within 
these classes. This result can be viewed, on the one hand, as a refinement (in the SL2-case) of 
equidistribution theorems of [LP] and [LSTl] on general words w and general Chevalley groups G, 
and, on the other hand, as a generalization of equidistribution theorems for some particular words: 
[GS] (commutator words on any finite simple G), [BGG] (Engel words on SL2), [BG] (words of the 
form w = x°-y^ on SL2). Acting in the spirit of [GS], we deduce a criterion for w: SL2 x SL2 SL2 
to be almost measure-preserving. It turns out that "good" (equidistributed, measure-preserving) 
words are essentially those whose trace polynomial cannot be represented as a composition of two 
other polynomials. 

Here are precise definitions and results. We will follow the approach to equidistribution adopted 
in [GS] (see Section 5.3.1 above). 
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Definition 6.7. (cf. [GS, §3] and Section 5.3.1) Let f : X ^ Y he & map between finite non-empty 
sets, and let £ > 0. We say that / is e-equidistributed if there exists Y' CY such that 

(i) #Y' > #Y{1 - s); 

(ii) m-'{y) - |#| < £|# for all y € Y'. 

The setting is as follows. Let a family of maps of finite sets Pq-. Xq ^ Yq he given for every 
q = p^. Assume that for all sufficiently large q the set Yq is non-empty. For each such q take y &Yq 
and denote 

Py = {xeXq:Pq{x)=y}. 

Definition 6.8. Fix a prime p. With the notation as above, we say that the family Pq : Xq Yq, 

q — , is p-equidistributed if there exist a positive integer no and a function : N — > N tending to 
as n — >■ 00 such that for all q = p" with n> tiq the set Yq contains a subset Sq with the following 
properties: 

(i) #5, < (#yj; 

(ii) |#P^ - f^l < for all yeYq\ Sq. 

Remark 6.9. Definition 6.8 means that for q = p^ large enough, the map Xq Yq \s £p{q)- 
equidistributed, in the sense of Definition 6.7. 

Definition 6.10. We say that the family Pq-. Xq ^ Yq is equidistributed if it is p-equidistributed 
for all p and there exists a function e : N ^ N tending to as n oo such that for every p and 
every q = p'^ large enough, we have £p{q) < s{q)- 

Let us now consider the case where Yq = Gq = SL(2, q). Xq = {GqY is a direct product of its two 
copies, and Pq = Pw,q ■ {GqY — ^ Gq is the morphism induced by some fixed word w € F^- 

Accordingly, we say that w is equidistributed (or p-equidistributed) if so is the family of maps 
Pw,q ■ SL(2, q) X SL(2, q) SL(2, q) (or, in other words, if so is the morphism : SL2,z x SL2,z — ^ 
SL2,z of group schemes over Z) . 

Recall some properties of polynomials. 

Definition 6.11. Let F be a finite field. We say that h G ¥[x] is a permutation polynomial if the 
set of its values {h{z)}ze¥ coincides with F. 

Theorem 6.12. [LN, Theorem 7.14] Let q = p". A polynomial h G Fg[a;] is a permutation polyno- 

k 

mial of all finite extensions of F, if and only if h = ax^ + b, where a ^ and k is a non-negative 

integer. 

The following notions are essential for our criteria. 

Definition 6.13. Let be a field. We say that a polynomial P G F[xi, . . . ,Xn] is F-composite if 
there exist Q G F[xi, . . . , a;„], degQ > 1, and h G F[z], degh > 2, such that P = hoQ. Otherwise, 
we say that P is F-noncomposite. 

Note that if E/F is a separable field extension, it is known [AP, Theorem 1 and Proposition 1] 
that P is F-composite if and only if P is F-composite. In particular, working over perfect ground 
fields, we may always assume, if needed, that F is algebraically closed. 

Definition 6.14. Let P G 'L\x\, . . . , a;„]. Fix a prime p. 

• We say that P is p-composite if the reduced polynomial Pp G ¥p[xi, . . . , Xn] is Fp-composite. 
Otherwise, we say that P is p-noncomposite. 

• We say that a p-composite polynomial P is p-special if, in the notation of Definition 6.13, 
Pp = hoQ where h G ¥p[x] is a permutation polynomial of all finite extensions of Fp. 
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Definition 6.15. Wc say that a polynomial P € Z[a;i, . . . ,.t„] is almost noncomposite if for every 
prime p it is cither p-noncomposite or p-special. Otherwise we say that P is very com,posite. 

Remark 6.16. If a polynomial P G Z[a;i, . . . is Q- noncomposite, it is p-noncomposite for all 
but finitely many primes p [BDN, 2.2.1]. If P e Z[a;i, . . . , a;„] is Q-composite, it is very composite. 

We can now formulate the main results of [BK]. 

Theorem 6.17. Let w & 7^2- The morphism : SL2,z x SL2,z — >■ SL2,z is p-equidistributed if and 
only if the trace polynomial f^, is either p-noncomposite or p-special. 

Theorem 6.18. Let w G 7^2- The morphism : SL2,z x SL2,z — ^ SL2,z is equidistributed if and 
only if the trace polynomial fw is almost noncomposite. 

Corollary 6.19. Suppose that for each p and all n big enough the image of Pw,p"'- SL(2,p") x 
SL(2,p") SL(2,p") contains all noncentral semisimple elements o/SL(2,p"). Then w is equidis- 
tributed. 

For a given word w G 7^2, let us now consider the family of groups Gq = PSL(2, g) and the 
corresponding word maps Wq-. Gq x Gq Gq. 

Proposition 6.20. If the morphism P^: SL2,z x SL2,z SL2,z is equidistributed {or p-equidistri- 
buted), then so is the family of maps Wq-. Gq x Gq ^ Gq. 

Here are the main ingredients of the proofs of Theorems 6.17 and 6.18: 

• Diagram 

Gq X Gq > Gq 

tr (6.1) 

A?,„,t(F,) — ^ Ai(F,) 
where Tr{x,y) = (tr(a;), tr(a;j/), tr(j/)). 

• An explicit Lang-Weil estimate (Ghorpade-Lachaud [GLl]): if fl" c is an absolutely 
irreducible hypersurface of degree d, then 

|#F(F,) -q^\<(d-l){d- 2)g3/2 + i2(d + 4)4g 

or, equivalently, #iJ(Fg) — q'^{l + n) with 

< q-'/^[{d - l){d - 2) + 12(d+ 4) V'/']. 

For q > Gd^ this gives |ri| < 1/2. 

• A generalized Stein-Lorenzini inequality [Na]: if fw,p is p-noncomposite, then the spectrum 
f(/t(;,p), i.e., the set of all points z e Al{¥p) such that the hypersurface C A^„((Fp), 
defined by the equation fyj{s,u,t) = z, is reducible, contains at most d — 1 points, where 
d = deg/^. The same is true for each aq{fw) := a{fw,p)r\Vq. Let z e A^(Fp) \ a{fw^p). 
Then is an irreducible hypersurface and hence satisfies the Ghorpade-Lachaud inequality. 

• Estimates for fibres of the trace map: 

Lemma 6.21. Let D{s,u,t) = {s^ - 4){t'^ - 4){s^ + 1'^ + u"^ - ust - 4) , and let A C A^^^ j be 
defined by the equation D = 0. Let H c Ag„ ^(Fp) be a hypersurface of degree d such that 
^ A. Then for tt from diagram (6.1) we have #n-'^{H){¥q) = i^H{¥q)q^{l + r2), where 

\r2\<Cd/q. 

• Estimates for the size of the value set of polynomials [Wa]: if R is not a permutation 
polynomial for F,, g = p", then it is not a permutation polynomial for any extension F^m 
of Fg and omits at least (g™ — l)/rfi values of F^m, where di = degR. 
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6.2.1. Composite trace polynomials. The goal of this section is to describe words in two variables 
whose trace polynomial is composite. We send the curious reader to [BK] for proofs. 

Throughout this section T„(x) stands for the n*'* Chebyshev polynomial, and Dn{x) = 2T„(x/2) 
for the v}^ Dickson polynomial. It is well known (sec, e.g., [LMT, (2.2)]) that this polynomial 
satisfies Dn{x + l/x) = x" + and is completely determined by this functional equation. 

We always assume that ■w{x, y) is written in the form 

u; = x"i2/^^..x"'-y'''- (6.2) 

and is reduced (all integers Oj, bj are nonzero). We call the number r the complexity of w. 

Definition 6.22. We say that two reduced words w = x°-^y^^ . . . .x"*^ and v = x'^^y'^^ . . . x'^^'y'^'-' , 
written in form (6.2), are trace-similar if r = r' , the array {\ai\} is a rearrangement of {|cj|}, and 
the array {\bi\} is a rearrangement of 

Note that if reduced words w = x'^^y^^ . . . x'^'~y^^ and v = x'^^y'^^ . . . x'^^'y'^^' , written in form (6.2), 
have the same trace polynomial, then w and v are trace-similar [Ho]. 
The following propositions are valid (see [BK] for the proofs). 

Proposition 6.23. Let w{x, y) = x°-^y^^ . . . x°-''y^'' , A = Y^ai, B = Y^bi. Assume that either A ^ 
or B ^{S. Assume that the trace polynomial fw{s,u,t) is C-composite, fw{s,u,t) = h(q(s,u,t)), 
where q G C[s, u, t] and h G C[z], deg h>2. Then h = Dd{z) for some d>2. 

Proposition 6.24. Let w be a reduced word of complexity r written in form. (6.2). // its trace 

polynomial fy^, is C-composite, fy^,{s,2i,t) = h{q{s,u,t)) where q G C[s,u,t] and h{x) = fix" + . . . is 
a polynomial in one variable of degree n, then r = nm and w is trace-similar to v{x,y)" where v is 
a word of complexity m. 

Proposition 6.25. Let w{x, y) = x'^y^ ... be a reduced word of complexity n such that fw{s, u, t) = 
Dn{q{s,u,t)) for some q. Then w{x,y) = {x°'y^)'^. 

Remark 6.26. The statements of Propositions 6.23, 6.24 and 6.25 remain valid if we replace C by 
(the algebraic closure of) a sufiiciently big prime field Fp, and "composite" by "p-composite" (p > po 

depending on w). 

Here are some concrete cases where one can get more conclusive results. 

Corollary 6.27. Let w{x, y) = x°'y^ ... be a reduced word of prime complexity r. Ifp > r and w is 

not p-equidistributed, then w = v{x,yY. 

Corollary 6.28. The word w{x,y) = x°'y^x'^y'^ is either equidistributed or equal to {x°'y^)'^ . 

All facts mentioned above allow one to describe a class of words within which a "generic" word 
induces the map which is almost equidistributed. More precisely, we have the following proposition. 

Proposition 6.29. Let TZ be the set of words w of prime complexity. Then the set S of words 
w gTZ, such that the corresponding word morphism : SL2,z x SL2,z — )■ SL2,z is p-equidistributed 
for all but finitely many primes p, is exponentially generic in TZ. 

(According to the terminology of [KS], this means that the proportion of words from S among 
all words from TZ of fixed length tends to 1 exponentially fast as the length tends to infinity.) 

This is proved by combining the results quoted above with the well-known fact (see, e.g., [AO]) 
stating that the class of words which are proper powers of other words is exponentially negligible. 
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7. Concluding remarks and open problems 

Wc conclude with a brief disciission of various ramifications, analogues and generalizations of 
results presented in this survey, focusing on open problems, the list of which does not pretend to be 
comprehensive and reflects the authors' taste. Most of them are borrowed from [BBGKP] , [GKP] , 
[BGKP], [EGG] and [BK]. 

Engel-like words and solvability properties. In light of Theorems 3.4 and 3.6 and subsequent dis- 
cussions in Section 4.1.4, it is natural to ask whether or not the property of a given sequence to 
characterize finite solvable groups is generic. A possible way to express it is the following conjecture. 

Conjecture 7.1. LetTZo (resp. TZ) denote the class of words f{x,y,z) € satisfying the following 
condition: there exists wo{x,y) G J-2 {resp. w) such that the sequence {vn \x,y)} {resp. {vn{x,y)}) 
generated by the first word wq {resp. w) and law f 

(i) does not contain the identity word, 

(ii) is Engel-like, 

(iii) descends along the derived series 

{resp. conditions (i)-(iii) and the additional one: 

(iv) for any finite group G the following holds: G is solvable if and only if there is n such that 
Vn{x,y) = 1 in G). 

Then the class TZ is generic within TZq {in the sense of [KS], as in Section 6.2). 

We hope that algebraic-geometric approaches developed in Section 4.1.4 could be useful in estab- 
lishing this conjecture. One can also try the following counter-part as a testing ground: in Conjecture 
7.1, replace throughout "solvable" with nilpotent and "derived scries" with "lower central scries". 

It would bo interesting to investigate further, in the spirit of [GKNP] and [BBGKP], relation- 
ship between finite groups and finite-dimensional Lie algebras from the point of view of solvability 
properties. Namely, one can put forward the following (maybe over-optimistic) conjecture. 

Conjecture 7.2. Let {vn{x,y)} be an Engel-like sequence of words in the free Lie algebra W2 which 

characterizes finite- dimensional solvable Lie algebras defined over fields of arbitrary characteristics 
{i.e., a finite- dimensional Lie algebra g defined over an arbitrary field K is solvable if and only if 
there is n such that Vn{x, y) = in g). Then the same sequence, regarded as a sequence of words in 
the free group Ti {i.e., viewing Lie bracket as commutator), characterizes finite solvable groups. 

Note that the assumption on arbitrary characteristics is essential: in [BBGKP] there have been 
exhibited sequences characterizing finite-dimensional Lie algebras defined over fields of characteristic 
zero but not over fields of prime characteristic (and not characterizing finite solvable groups). The 
first intriguing example is the sequence defined by 

vi{x,y) = [x,y], Vn+i{v,y) = [[vn{x,y),x],[vn{x,y),y]]. 

This example resists the algebraic-gc^ornctric approach described in Section 3 for pin-cly computa- 
tional reasons: the arising equations lead to varieties which are out of range of SINGULAR and 
MAGMA. Perhaps experts in computer algebra, who are able to apply more sophisticated methods, 
will be more lucky. 

It is desirable to use Engel-like sequence for characterizing the solvable radical of a finite group in 

the same way as the original Engel sequence is used to characterize the nilpotent radical. A theorem 
of Baer [Ba] says that the nilpotent radical of a finite group G coincides with the set of elements 
y G G with the following property: for every x G G there is n = n{x,y) such that en{x,y) = 1. 
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Problem 7.3. Exhibit an Engel-like sequence {w„(a;,y)} such that the solvable radical of any finite 
group G coincides with the collection of y € G with the property: for every x £ G there is n = n{x, y) 
such that Vn{x,y) = 1. 

A recent result of J. Wilson [Wi], stating the existence of a two- variable countable set of words 
with the required property, gives a strong evidence that such an Engel-like sequence should exist 
but does not provide any candidate. Note that even the toy problem of characterizing the solvable 
radical of a finite-dimensional Lie algebra with the help of Engel-like sequences remains open in the 
case where the ground field is of positive characteristic; sec [BBGKP] and [GKP]. We hope that 
algebraic-geometric machinery in the spirit of Section 3 may turn out to be useful for achieving this 
goal. 

Borel's theorem and around. Let G be a connected semisimple algebraic group defined over an 
infinite field k. 

Problem 7.4. What can be said about the "fine structure" ofiv{G)7 In particular, describe w such 
that w{G) contains the set of all semisimple elements of G. 

A similar problem was discussed in [KBMR] for associative noncommutativc polynomials on 
associative matrix algebras, in the spirit of Kaplansky's problem. In such a setting, the resulting 
maps may well be non-dominant, and not only for obvious reasons mentioned in the introduction. 
In the paper cited above there have been described certain classes of polynomials which arc not 
central and whose image contains elements with nonzero trace but the induced map is not dominant 
on 2 X 2-matrices. Here are some general questions remaining open: 

Question 7.5. [KBMR] Let P be an associative, noncommutative, noncentral, multilinear polyno- 
mial in d variables whose image contains matrices with nonzero trace. Does P induce a dominant 
map M{n,K)'^ — >■ M{n,K)l Does there exist P such that this map is not surjcctive? 

Similar problems were discussed in [BGKP] for Lie polynomials P on Chevalley Lie algebras q. 
It was shown that the induced map is dominant provided P is not identically zero on st(2). We do 
not know whether or not the latter assumption can be dropped: 

Question 7.6. Does there exist a Lie polynomial in d variables, not identically zero on a Chevalley 
Lie algebra g, such that the induced map g'^ — >■ g is not dominant? 

It would be interesting to understand the situation with infinite- dimensional simple Lie algebras 
(as well as with finite-dimensional algebras of Cartan type over fields of positive characteristic) . The 
first question, which does not seem too complicated, is the following one: 

Question 7.7. For which Lie algebras q of Cartan type the map fl x g — )• g, {x,y) ^ [x,y], is 
surjective? 

If for some g this question is answered in the affirmative, one can continue by looking at the Engel 

maps, as in [BGKP]. 

Remark 7.8. It would be interesting to consider a more general set-up when we have a polynomial 
map P: L''- — >■ L*. (This means that we consider systems of equations rather than single equations.) 
In [GR] some dominance results were obtained for the multiple commutator map P: L x L'^ ^ L*^ 
given by the formula P{X, Xi, . . . , X^) = {[X, X^], . . . , [X, Xa]). 

Remark 7.9. In a similar spirit, one can consider generalized word maps w: G''' ^ G^ on simple 
groups. Apart from [GR], see also a discussion of a particular case w = {wx,W2)'- G^ G^ in 
[BGGT, Problem 1]. 
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Remark 7.10. It would be interesting to find more classes of infinite simple (or close to simple) 
groups admitting some analogue of Borel's statement in the sense that the image of the word map 
is "large" , at least for a generic word. Some such classes were discussed in the literature: infinite 
symmetric groups [Ly] (surjectivity of the commutator word was established by Ore [Or]), groups 
of automorphisms of trees [Mar] and of random graphs [DT]. 

We would suggest looking at Cremona groups, which share many common properties with linear 
algebraic groups and in which such notions as Zariski topology and dominance can be defined (see, 
e.g., [Ser]). 

Problem 7.11. Let w £ J^^he a, nontrivial word. Is the corresponding word map w: {Cr{n, k)Y — >■ 
Cr{n,k) dominant? 

Of course, the first case to be considered is n = 2. Recall two recent spectacular results on 
Cr(2, k) answering long-standing questions on its simplicity. It turns out that this group is at the 
same time simple and non-simple: it does not contain nontrivial normal subgroups closed in the 
topology mentioned above [Bl] but contains lots of abstract normal subgroups [CL]. 

Remark 7.12. In light of the previous remark, one can note that analogues of Borel's theorem may 

look differently at the first glance when one considers other classes of groups. Say, a certain analogue 
of Borel's theorem for profinite groups is provided by the following deep theorem by Nikolov and 
Segal [NS2] , [NS3] : let G be a finitely generated profinite group, let w be a non-commutator word, 
and let {w{G)) denote the corresponding verbal subgroup (i.e., the subgroup generated by the set 
{w{g)^^},g e G); then {'w{G)) is open in G. The original proof in [NS2] went through the solution 
of the restricted Burnside problem [Ze]. An alternative proof in [NS3] does not rely on Zelmanov's 
theorem (and, as pointed out to us by the referee, implies the statement of the restricted Burnside 
problem). 

Remark 7.13. To prevent the reader from an overoptimistic view on Borel's theorem, one has to 
note that there may be a significant gap between dominance and surjectivity. Moreover, the image 
of the word map may be very large in the Zariski topology (according to Borel's theorem) but very 
small in some natural topology. See [ThA] where such word maps are constructed for real compact 
Lie groups. Note that the behaviour of Engel word maps on such groups is much better: they are 
all surjective [ET]. 

Word maps on finite simple groups. Regarding the image of the word map, we can recall here 
Thompson's and Shalev's Conjectures 5.14, 5.5 and 5.18 discussed in Section 5.2.2. In particular, 
one can ask whether the following variant of Shalev's Conjecture 5.5, for the family of groups 
PSL(2,g), holds: 

Conjecture 7.14 (Shalev). ^ Assume that w = w{x,y) e T2 is not of the form v{x, y)™ for some 
V = v{x,y) e J^2 and m > 1. Then there exists a constant qo{w) such that if q > qoiw) then 
w{G)=G forG = PSL{2,q). 

In particular, this conjecture holds for Engel words and for words of the form (see Section 
6.1 and [BGG], [BG]). One can therefore attempt to use the trace map method described in Section 
2.3 to find more words satisfying the above conjecture. In particular, the following questions are 
raised (see Section 4.1.4 and [BGG, Section 8]). 

Question 7.15. [BGG] What arc the words w = w{x,y) G J-2 for which the corresponding trace 

map ipi-s, u, t) ~ (/i(s, u, t), /2(s, m, t), t) has one of the following properties: 

{■k) the set {/i(s, u, t) — a] is absolutely irreducible for almost all q and for every a G F^? 



After the first version of the present survey had been submitted, a counter-example to this conjecture was 
constructed in [JLO]. 
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there exists a ?/;-invariant plane A and the curves {V' |^= a} are absolutely irreducible for 

for almost all q and for a general a G Fg? 

It is tempting to generalize the results on the image of some word maps of [BGG] , [BG] , presented 
in Section 6.1, as well as the criteria for almost equidistribution of [BK], presented in Section 6.2, 
in the following directions: 

(i) extend them from words in two letters to words in d letters, d > 2; 

(ii) keep d = 2 but consider arbitrary finite Chevalley groups; 

(iii) combine (i) and (ii). 

Whereas in case (i) one can still hope to use trace polynomials, which exist for any d, to produce 
criteria for almost equidistribution, cases (ii) and (iii) require some new terms for formulating such 
criteria and new tools for proving them. 

Regardless of getting such criteria, it would be interesting to compare, in the general case, the 
properties of having large image and being equidistributed, in the spirit of Corollary 6.19. We dare 
to formulate the following conjecture. 

Conjecture 7.16. For a fixed p, let Gq he a family of Chevalley groups of fixed Lie type over Fg 
[q = p" varies). For a fixed word w € Td, d> 2, let Pg = Pw,q- {GqY — ^ Gq be the corresponding 
map. Suppose that 

(**) for all n big enough the image of Pq contains all regular semisimple elements of Gq. 
Then the family {Pq} is almost p- equidistributed. 

It is a challenging task to describe the words w satisfying condition (**) in Conjecture 7.16 (cf. 
the discussion in [LSTl] after Theorem 5.3.2). Certainly, words of the form w = v'^, k > 2, do not 
satisfy this condition. We do not know any non-power word for which (**) does not hold. 

Other interesting problems, arising in the context of measure preservation and primitivity, were 
raised by Puder and Parzanchevski [PP] . Namely, in [PP] it was shown that the property of a word 
w to be measure-preserving within the class of all finite groups can be detected on the family of all 
symmetric groups Sn- Arc there other natural families (say. PGL(ri, g)) that can be used as such 
detectors? They also ask whether their results on measure preservation can be extended to the class 
of compact Lie groups (with respect to the Haar measure). 

One can try yet another direction: consider equidistribution problems for matrix algebras and for 
polynomials more general than word polynomials (see Introduction). Even the case of 2 x 2-matrices 
is completely open. 

Miscellaneous remarks and problems. 

Remark 7.17. Recently, Shalev with his collaborators extended many results of Waring type from 
finite simple groups to some simple algebraic groups over p-adic integers. The case of simple Cheval- 
ley groups over rings of integers remains widely open, see [Sh3] for some relevant questions and 
conjectures. 

Remark 7.18. A new dynamical viewpoint on the image of the word map was developed by Schul 
and Shalev [SS]. They showed that the random walk on any finite simple group G, with respect to 
this image as a generating set, has mixing time 2. 

Remark 7.19. One could try to extend some of the results of this survey to the case of matrix 
groups or algebras over some sufficiently good ring. One has to be careful: say, in [RR] there are 
examples of rings R such that not every element of sl{n, R) is a commutator. 

Remark 7.20. One can ask questions of Borel's type for other classes of algebras (beyond groups. 
Lie algebras and associative algebras). The interested reader may refer to [Gor] for the case of values 
of commutators and associators on alternative and Jordan algebras. 
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Remark 7.21. It would be interesting to extend (at least part of) the methods and results doseribed 
in the present survey to matrix equations with constant matrix coefficients, as discussed in the 
introduction. 
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